Recent Progress in the CUHK Dysarthric Speech Recognition System
نویسندگان
چکیده
Despite the rapid progress of automatic speech recognition (ASR) technologies in past few decades, disordered remains a highly challenging task to date. Disordered presents wide spectrum challenges current data intensive deep neural networks (DNNs) based ASR that predominantly target normal speech. This paper recent research efforts at Chinese University Hong Kong (CUHK) improve performance systems on largest publicly available UASpeech dysarthric corpus. A set novel modelling techniques including architectural search, augmentation using spectra-temporal perturbation, model speaker adaptation and cross-domain generation visual features within an audio-visual (AVSR) system framework were employed address above challenges. The combination these produced lowest published word error rate (WER) 25.21% test 16 speakers, overall WER reduction 5.4% absolute (17.6% relative) over CUHK 2018 featuring 6-way DNN cross out-of-domain trained systems. Bayesian further allows individual speakers be performed as little 3.06 seconds efficacy demonstrated CUDYS Cantonese task.
منابع مشابه
Recent Progress in the Sphinx Speech Recognition System
This paper describes recent improvements in the SPHINX Speech Recognition System. These enhancements include function-phrase modeling, between-word coarticulation modeling, and corrective training. On the DARPA resource management task, SPHINX attained a speaker-independent word accuracy of 96% with a grammar (perplexity 60), and 82% without grammar (perplexity 997).
متن کاملSpeech Recognition Technology for Dysarthric Speech
The initial results of investigations into the use of current commercial automatic speech recognition (ASR) software by people with speech disability (dysarthria) is presented, together with a brief summary of the history of the development of ASR and its applications for the disabled. Results confirm the viability of dysarthric use, identify areas of further investigation for improved recognit...
متن کاملRecent Progress in Robust Vocabulary-Independent Speech Recognition
This paper reports recent efforts to improve the performance of CMU's robust vocabulary-independent (VI) speech recognition systems on the DARPA speaker-independent resource management task. The improvements are evaluated on 320 sentences that randomly selected from the DARPA June 88, February 89 and October 89 test sets. Our first improvement involves more detailed acoustic modeling. We incorp...
متن کاملRecent Progress in Corpus-Based Spontaneous Speech Recognition
This paper overviews recent progress in the development of corpus-based spontaneous speech recognition technology. Although speech is in almost any situation spontaneous, recognition of spontaneous speech is an area which has only recently emerged in the field of automatic speech recognition. Broadening the application of speech recognition depends crucially on raising recognition performance f...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE/ACM transactions on audio, speech, and language processing
سال: 2022
ISSN: ['2329-9304', '2329-9290']
DOI: https://doi.org/10.1109/taslp.2021.3091805